On Resampling for Statistical Con dentiality in Contingency Tables
نویسنده
چکیده
Resampling schemes, and especially the bootstrap method, were proposed as a subclass of perturbation methods to ensure statistical conndentiality in statistical databases. Later, a method based on bootstrapping was presented to achieve the more speciic task of anonymising contingency tables. In this paper, we argue that the latter proposal is either ineecient from a computational point of view or insecure due to a high disclosure risk. For illustration, we show that this bootstrap-based procedure for contingency tables can be emulated and outperformed by a cell-oriented random perturbation method, whose complexity can be theoretically quantiied. For a given disclosure risk, our cell-oriented perturbation method is more eecient. For a given computational complexity, our cell-oriented method exhibits a lower disclosure risk. More generally, it can be concluded that the very principle of resampling precludes the design of contingency table anonymisation schemes simultaneously providing security, computational eeciency and data quality.
منابع مشابه
Plain Answers to Several Questions about Association/Independence Structure in Complete/Incomplete Contingency Tables
In this paper, we develop some results based on Relational model (Klimova, et al. 2012) which permits a decomposition of logarithm of expected cell frequencies under a log-linear type model. These results imply plain answers to several questions in the context of analyzing of contingency tables. Moreover, determination of design matrix and hypothesis-induced matrix of the model will be discusse...
متن کاملPartial Association Components in Multi-way Contingency Tables and Their Statistiical Analysis
In analyses of contingency tables made up of categorical variables, the study of relationship between the variables is usually the major objective. So far, many association measures and association models have been used to measure the association structure present in the table. Although the association measures merely determine the degree of strength of association between the study varia...
متن کاملOn Some Research Issues in Multilevel Database Security
Integrity and con dentiality have traditionally been treated as distinct objectives. In the arena of MLS Operating Systems this dichotomy has not been a problem. The integrity demands placed on an OS do not signi cantly con ict with multilevel con dentiality. However, for MLS DBMS's the situation is quite di erent. Integrity and con dentiality are often in direct con ict, as one seeks to retain...
متن کاملAnalysis of Dynamic Longitudinal Categorical Data in Incomplete Contingency Tables Using Capture-Recapture Sampling: A case Study of Semi-Concentrated Doctoral Exam
Abstract. In this paper, dynamic longitudinal categorical data and estimation of their parameters in incomplete contingency tables are evaluated. To apply the proposed method, a study has been conducted on the data of the semi-concentrated doctoral exam of the National Organization for Educational Testing (NOET). The results of studies such as the obtained confidence intervals and calculating t...
متن کاملSecurity Models
Even if we limit ourselves to models of con dentiality, there are two related, but distinct, senses of the term security model in the computer security literature [McL90b]. In the more limited use of the term, a security model speci es a particular mechanism for enforcing con dentiality, called access control, which was brought over into computer security from the world of documents and safes. ...
متن کامل